NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Examining Spanish Counseling with MIDAS: a Motivational Interviewing Dataset in Spanish

https://doi.org/10.18653/v1/2025.naacl-short.73

Gunal, Aylin Ece; Yi, Bowen; Piette, John D; Mihalcea, Rada; Perez-Rosas, Veronica (January 2025, Association for Computational Linguistics)

Full Text Available
Dynamic Reward Adjustment in Multi-Reward Reinforcement Learning for Counselor Reflection Generation

Min, Do June; Perez-Rosas, Veronica; Resnicow, Ken; Mihalcea, Rada (August 2024, ELRA and ICCL)

In this paper, we study the problem of multi-reward reinforcement learning to jointly optimize for multiple text qualities for natural language generation. We focus on the task of counselor reflection generation, where we optimize the generators to simultaneously improve the fluency, coherence, and reflection quality of generated counselor responses. We introduce two novel bandit methods, DynaOpt and C-DynaOpt, which rely on the broad strategy of combining rewards into a single value and optimizing them simultaneously. Specifically, we employ non-contextual and contextual multi-arm bandits to dynamically adjust multiple reward weights during training. Through automatic and manual evaluations, we show that our proposed techniques, DynaOpt and C-DynaOpt, outperform existing naive and bandit baselines, showcasing their potential for enhancing language models.
more » « less
Full Text Available
How developments in natural language processing help us in understanding human behaviour

https://doi.org/10.1038/s41562-024-01938-0

Mihalcea, Rada; Biester, Laura; Boyd, Ryan L; Jin, Zhijing; Perez-Rosas, Veronica; Wilson, Steven; Pennebaker, James W (October 2024, Nature Human Behaviour)

Full Text Available
VERVE: Template-based ReflectiVE Rewriting for MotiVational IntErviewing

https://doi.org/10.18653/v1/2023.findings-emnlp.690

Min, Do; Perez-Rosas, Veronica; Resnicow, Ken; Mihalcea, Rada (December 2023, Association for Computational Linguistics)

Full Text Available
Learning from Personal Longitudinal Dialog Data

https://doi.org/10.1109/MIS.2019.2916965

Welch, Charles; Perez-Rosas, Veronica; Kummerfeld, Jonathan; Mihalcea, Rada (July 2019, IEEE intelligent systems)

We explore the use of longitudinal dialog data for two dialog prediction tasks: next message prediction and response time prediction. We show that a neural model using personal data that leverages a combination of message content, style matching, time features, and speaker attributes leads to the best results for both tasks, with error rate reductions of up to 15\% compared to a classifier that relies exclusively on message content and to a classifier that does not use personal data.
more » « less
Full Text Available
Look Who's Talking: Inferring Speaker Attributes from Personal Longitudinal Dialog

Welch, Charles; Perez-Rosas, Veronica; Kummerfeld, Jonathan; Mihalcea, Rada (April 2019, Proceedings of the 20th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing))

We examine a large dialog corpus obtained from the conversation history of a single individual with 104 conversation partners. The corpus consists of half a million instant messages, across several messaging platforms. We focus our analyses on seven speaker attributes, each of which partitions the set of speakers, namely: gender; relative age; family member; romantic partner; classmate; co-worker; and native to the same country. In addition to the content of the messages, we examine conversational aspects such as the time messages are sent, messaging frequency, psycholinguistic word categories, linguistic mirroring, and graph-based features reflecting how people in the corpus mention each other. We present two sets of experiments predicting each attribute using (1) short context windows; and (2) a larger set of messages. We find that using all features leads to gains of 9-14% over using message text only.
more » « less
Full Text Available

Search for: All records